Skip to content

Conversation

@zrr1999
Copy link
Member

@zrr1999 zrr1999 commented Jan 4, 2026

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

模板化 FractionalEndIndex FractionalStartIndex FractionalRationalU ,这样外部传入int64 时就不会有数值溢出风险了,而且不需要改动其他代码。

pcard-93269

@paddle-bot
Copy link

paddle-bot bot commented Jan 4, 2026

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@zrr1999 zrr1999 changed the title [CodeStyle][large-tensor] [CodeStyle][large-tensor] template some public functions Jan 5, 2026
@zrr1999 zrr1999 changed the title [CodeStyle][large-tensor] template some public functions [CodeStyle][large-tensor]template some public functions Jan 5, 2026
@zrr1999 zrr1999 changed the title [CodeStyle][large-tensor]template some public functions [CodeStyle][large-tensor] template some public functions Jan 5, 2026
Copy link
Contributor

@wanghuancoder wanghuancoder left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@wanghuancoder wanghuancoder merged commit e73960c into PaddlePaddle:develop Jan 8, 2026
73 of 78 checks passed
zrr1999 added a commit to zrr1999/Paddle that referenced this pull request Jan 8, 2026
zrr1999 added a commit to zrr1999/Paddle that referenced this pull request Jan 15, 2026
sneaxiy pushed a commit that referenced this pull request Jan 15, 2026
* add _C_ops.rms_norm (#76930)

* mv fastln to ln

* fix build error on win and hip

* fix build error for infer

* support backward

* fix error

* merge fast_rms_norm and fused_rms_norm_ext to rms_norm

* fix CMake error

* remove windows version

* add rms_norm for xpu

* fix cmake

* fix cmake2

* fix test_rms_norm

* rms_norm align torch

* fix test_rms_norm_xpu

* remove fast_rms_norm in rms_norm

* rename rms_norm_nzs to rms_norm

* fix

* fix RmsNormInferMeta

* mv fused_rms_norm_quant form ops.yaml to fused_ops.yaml

* revert mv fused_rms_norm_quant form ops.yaml to fused_ops.yaml

* fix rms_norm docs

* fix doctest

* fix doctest

* fix test_rms_norm

* fix test cmakefile

* fix test_rms_norm

* add InferSymbolicShapeInterface

* add check_pir

---------

Co-authored-by: DanielSun11 <[email protected]>

* fix large tensor issues (#77154)

* [CodeStyle][large-tensor] fix some clang-tidy narrowing conversion errors (#77183)

* update

* update 2

* [Precision Depth Alignment] align paddle.lerp forward and backward (#77149)

* fix

* align lerp to torch

* fix cpu

* fix ce

* Optimize the Cuda Kernel performance of Paddle rms_norm (#77098)

* accuracy and Torch alignment

* support rms_norm behavior to be the same as torch

* fix rms_norm_xpu_kernel

* add valueError_test

* Revert "add valueError_test"

This reverts commit ccaaa1b.

* Reapply "add valueError_test"

This reverts commit 19513e8.

* optimize performance

* add vectorization

* fix

* fix dtype of normalized_shape

* fix pooling (#77191)

* fix stridedSliceRawGradStrided (#77244)

* fix backward of rms_norm (#77274)

---------

Co-authored-by: zhengshengning <[email protected]>
Co-authored-by: DanielSun11 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants